Ibots Learn Genuine Team Solutions

نویسندگان

  • Cristina Versino
  • Luca Maria Gambardella
چکیده

\Ibots" (Integrating roBOTS) is a computer experiment in group learning. It is designed to understand how to use reinforcement learning to program automatically a team of robots with a shared mission. Moreover, we are interested in deriving genuine team solutions. These are policies whose form strongly depends on the number of robots composing the team, on their individual skills and weaknesses, and on any other mission boundary condition which makes it worth to prefer \at a team level" certain solutions to others. The Ibots learn to accomplish the integration mission by means of a reinforcement signal which measures their performance as a team. This form of payo leads to genuine team solutions. Bene ts and drawbacks of using a single team payo as opposed to individual robot payo s are discussed.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Ibots: Learning Real Team Solutions

This paper presents \Ibots" (Integrating roBOTS), a computer experiment in team robotics designed on an arti cial mission. Our aim is to understand how to use reinforcement learning to program automatically a team of robots with a shared mission. Moreover, we are interested in learning real team solutions. These are programs whose form strongly depends on the number of robots composing the team...

متن کامل

Learning Real Team Solutions

This paper presents \Ibots" (Integrating roBOTS), a computer experiment in group learning designed on an arti cial mission. By this experiment, our aim is to understand how to use reinforcement learning to program automatically a team of robots with a shared mission. Moreover, we are interested in learning real team solutions. These are programs whose form strongly depends on the number of robo...

متن کامل

Team formation and biased self-attribution

There exists extensive evidence that people learn positively about themselves. We build on this finding to develop a model of team formation in the workplace. We show that learning positively about oneself systematically undermines the formation of teams. Agents becoming overconfident tend to ask for an excessive share of the group outcome. Positive learning generates divergence in workers' bel...

متن کامل

Combination of Experimental Design and Desirability Function as a Genuine Method to Achieve Common Optimal Conditions for the Adsorption of Pb(II) and Cu(II) onto the Poplar Tree Leaves: Equilibrium, Kinetic and Thermodynamic Studies

In this study, the ashes of poplar tree leaves are applied as an efficient, accessible and inexpensive biosorbent for the removal of heavy metals Pb2+ and Cu+2 in aqueous solutions. In the adsorption processes, the success of the ions removal highly depends on the level of several experimental factors such as pH, contact time, adsorbent dosage and temperature. Therefore, a genuine statistical e...

متن کامل

Optimal non-adaptive solutions for the counterfeit coin problem

We give optimal solutions to all versions of the popular counterfeit coin problem obtained by varying whether (i) we know if the counterfeit coin is heavier or lighter than the genuine ones, (ii) we know if the counterfeit coin exists, (iii) we have access to additional genuine coins, and (iv) we need to determine if the counterfeit coin is heavier or lighter than the genuine ones. Moreover, ou...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997